Shielding in Resource-Constrained Goal POMDPs

نویسندگان

چکیده

We consider partially observable Markov decision processes (POMDPs) modeling an agent that needs a supply of certain resource (e.g., electricity stored in batteries) to operate correctly. The is consumed by the agent's actions and can be replenished only states. aims minimize expected cost reaching some goal while preventing exhaustion, problem we call resource-constrained optimization (RSGO). take two-step approach RSGO problem. First, using formal methods techniques, design algorithm computing shield for given scenario: procedure observes prevents it from might eventually lead exhaustion. Second, augment POMCP heuristic search POMDP planning with our shields obtain solving implement present experiments showing its applicability benchmarks literature.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Constrained Renewable Resource Allocation in Fuzzy Metagraphs via Min-Slack

This paper discusses that the fuzzy metagraphs can be used as a tool for scheduling and control of fuzzy projects. Often, available resources for executing projects may be limited. It is assumed the resources required to accomplish each activity of project (metagraph edges) is renewable. One of the common methods for scheduling projects is using the min-slack. So, first, the forward and bac...

متن کامل

Piecewise Linear Dynamic Programming for Constrained POMDPs

We describe an exact dynamic programming update for constrained partially observable Markov decision processes (CPOMDPs). State-of-the-art exact solution of unconstrained POMDPs relies on implicit enumeration of the vectors in the piecewise linear value function, and pruning operations to obtain a minimal representation of the updated value function. In dynamic programming for CPOMDPs, each vec...

متن کامل

Point-Based Value Iteration for Constrained POMDPs

Constrained partially observable Markov decision processes (CPOMDPs) extend the standard POMDPs by allowing the specification of constraints on some aspects of the policy in addition to the optimality objective for the value function. CPOMDPs have many practical advantages over standard POMDPs since they naturally model problems involving limited resource or multiple objectives. In this paper, ...

متن کامل

Exploiting Fully Observable and Deterministic Structures in Goal POMDPs

When parts of the states in a goal POMDP are fully observable and some actions are deterministic it is possible to take advantage of these properties to efficiently generate approximate solutions. Actions that deterministically affect the fully observable component of the world state can be abstracted away and combined into macro actions, permitting a planner to converge more quickly. This proc...

متن کامل

Reasoning with Constrained Goal Models

Goal models have been widely used in Computer Science to represent software requirements, business objectives, and design qualities. Existing goal modeling techniques, however, have shown limitations of expressiveness and/or tractability in coping with complex real-world problems. In this work we exploit advances in automated reasoning technologies, notably SMT solvers, to propose and formalize...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i12.26715